Stylistic Variation in an Information Retrieval Experiment

نویسنده

  • Jussi Karlgren
چکیده

Texts exhibit considerable stylistic variation. This paper reports an experiment where a corpus of documents (N= 75 000) is analyzed using various simple stylistic metrics. A subset (n = 1000) of the corpus has been previously assessed to be relevant for answering given information retrieval queries. The experiment shows that this subset differs significantly from the rest of the corpus in terms of the stylistic metrics studied.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ëû×× Áò×øøøùøø Óó Óñôùøøö Ëëëëòòò Ëøýðð×øøø Üôööññòø× Óö Áòòóöññøøóò Êêøöööúð Âù××× Ããöððööò

Information retrieval systems are built to handle texts as topical items: texts are tabulated by occurrence frequencies of content words in them, under the assumption that text topic is reasonably well modeled by content word occurrence. But texts have several interesting characteristics beyond topic. The experiments described in this text investigate stylistic variation. Roughly put, style is ...

متن کامل

Stylistic Experiments in Informationretrieval

A discussion on various experiments to utilize stylistic variation among texts for information retrieval purposes. 1. Stylistics Texts vary in many ways. Authors make choices when they write a text: they decide how to organize the material they have planned to introduce; they make choices between synonyms and syntactic constructions; they choose an intended audience for the text. Authors will m...

متن کامل

Visualizing Stylistic Variation

Texts vary not only by topic, but by style; indeed, often the variation between texts ‘about the same thing’ can be just as noticeable as the variation between texts ‘about different things’. Some facets of this variation are quite easy to detect, and quite predictable when applied to categorization of texts by genre, functional style, or tentatively quality. Making use of such variation in an ...

متن کامل

Textual Stylistic Variation: Choices, Genres and Individuals

T his chapter argues for more informed target metrics for the statistical processing of stylistic variation in text collections. Much as operationalized relevance proved a useful goal to strive for in information retrieval, research in textual stylistics, whether application oriented or philologically inclined, needs goals formulated in terms of pertinence, relevance, and utility—notions that a...

متن کامل

Stylistic Analysis Of Text For Information Access

Papers from the workshop held in conjunction with the 28th Annual International ACM Conference on Research and Development in Information Retrieval, August 13-19, 2005, Salvador, Bahia, Brazil

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره cmp-lg/9608003  شماره 

صفحات  -

تاریخ انتشار 1994